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COMBINATORIAL BIOLOGY 

The present application claims priority to United States Provisional Application USSN 
60/1 14,909, filed January 5, 1999, the entire contents of which are incorporated herein by 
reference. 

Background of the Invention 

The ability of nature to produce small molecules having both structural complexity and 
biological potency has led to the development of countless number of therapeutic agents such as 
taxol, penicillin, and quinine, to name a few. Specific organisms found in nature have developed 
the ability to produce secondary metabolites in response to the challenges and needs that are 
encountered in their particular environment. In particular, through the random recombination 
and mutation of existing genetic material, the organism is able, in a combinatorial sense, to 
generate new biosynthetic enzymes that catalyze the assembly of new organic compounds (see , 
Verdine "The Combinatorial Chemistry of Nature" Nature, 1996, 384, 1 1). 

Clearly, the ability to generate a diverse array of compounds reminiscent of these natural 
products would be desirable to increase the arsenal of available small molecules available for 
testing and use as therapeutic agents. Just as many scientists involved in the discovery and 
isolation of new natural products sample organisms from many different environments such as 
coral reefs, deep-sea hydrothermal vents, and tropical rainforests in their quest for structural 
diversity, both biologists and chemists have been searching for new ways to achieve this 
structural diversity by manipulating genetic material directly, or by generating novel synthetic 
pathways, respectively. 

fV example, certain researchers have attempted to alter the catalytic capability of 
syShetic enzymes that naturally produce interesting biologically active compounds by making 
'specific changes to genes encoding particular enzymes (see, for example, Cortes et al., Science 
2^8:1487, 199sVao et al., J. Am. Chem. Soc. 1 17:9105, 1995; Donadio et al., Science 252:675, 
1991; WO 93/1363*: U.S. Patent Number 5,824,513; WO 98/49315; U.S. Patent Number 
5,652,1 16; U.S. Patekt Number 5,824,774; WO 98/51695; U.S. Patent number 5,795,738; and 
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WO 98/01 546). The h&tfeis that, by modifying the gene encoding the synthetic enzyme, the 
researchers will bdable to generate new enzymes with altered synthetic characteristics, which 
jQ new enzymp^will in turn generate new chemical compounds that are related to those produced 
by the ndturally-occurring enzymes, and therefore are likely to have similar desirable biological 
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actyntes. 

ConOprrently, several groups have also been interested in geneating combinatorial 
libraries of compounds synthesized using novel synthetic methods. Specifically, in many cases, 

rchers hav^developed ^bi ^ which all members share a particular 

characteristic, su2h as an ability to interact with a particular target ligand, or a characteristic 
/structural feature designed to mimic a particular aspect of a class of natural compounds. For 
I example, a number M libr aries have been designed to mimic one or more features of natural 
peptides. Such ^pe^^ phthalimido libraries (WO 97/22594), 

thiophene libraries (WO 97/40034), benzodiazopene libraries (US 5, 288, 514), libraries formed 
by the sequential reacti<m of dienes (WO 96/03424), thiazolidinone libraries, libraries of 
metathiazanones and theft derivatives (US 5, 549, 974), and azatide libraries (WO 97/35199) (for 
review of peptidomimetic Technologies, see Gante, J., Angew. Chem. Int. Ed. Engl. 1994, 33, 
1699-1720 and references cited therein). Each of these libraries has provided solid phase 
synthetic strategies for compounds possessing specific core functionalities, but none achieves the 
complexity of structure found in natural products, or in other lead compounds prepared through 
traditional chemical synthetic routes. Complex natural products commonly contain several 
different functionalities and often ape rich in stereochemical complexity. Such diversity and 
complexity is difficult to achieve if ^je synthesis is restricted to compounds containing a specific 
co r e f unc tionalit y . 

Clearly, there remains a need to develop an efficient and powerful system for the 
generation of large numbers of compounds having unprecedented stereochemical, structural, 
topological and functional diversity. The present invention provides a unique method in which 
both the tools provided by nature and modern organic synthesis can be utilized. 



Summary of the Invention 
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The present invention provides a method for merging c ombinator ial biosynthe sis 
incorporated above with techniques of synthetic organic chemistry. In general, this method, 



^combinato^ 1) providing "starter units", wherein the starter units are capable 

of being accepted by the modular biosynthetic enzymatic machinery, and have incorporated 
therein a "functional handle" capable of reacting with specific functionality present on a solid 
10 support; 2) feeding these "starter units" into the modular biosynthetic enzymatic machinery, in 

vivo or in vitro, to obtain complex template molecules; and 3) further functionalizing the 
complex template molecules using synthetic organic chemistry to provide a collection of 
complex "unnatural" natural products having structural, topological, stereochemical and 
functional diversity. 

15 In a^prefeired embodiment, the starter units are attached to solid support units prior to 

feeding the starter units into the modular biosynthetic enzymatic machinery, and thus the support 

•M bound starter units are fed into the biosynthetic enzymatic machinery to generate a collection of 

y complex template structures. These template structures thus generated can then be further 

functionalized using synthetic organic chemistry, or can be further functionalized using any^ 

=So combination^^ organic chemistry and reintroduction into the biosythetic enzymatic 

machinery. 

In another preferred embodiment, the starter units are fed to the modular biosynthetic 



=f enzymatic machine^^nor toH ging attached to the so M^^Qrt, and thus template structures 

:3 are generated. In particularly preferred embodiments specific functionalities can also be 

' is incorporated into the template structures via the original starter unit capable of bein g recogniz ed 

by an antibody and then purified. Alternatively or additionally, these templates can then be 
attached to solid supporTUnits and further functionalized using synthetic organic chemistry or 
any combination of synthetic organic chemistry and the biosynthetic enzymatic machinery. 

30 Vj^"^ Description of the Drawin g 

A^j^P / Figure 1 depi^tiie biosynthesis of several gfefliror^in d several 
different starter umts. 
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5 Figure 2 depicts potential starter units with functional handles classified by the 

biosynthetic pathway. 

Figure 3 depicts the functionalization of an iodoaryl compound to generate an 
arylacetylene via Sonogashiro/Castro-Stephens coupling reaction. 

Figure 4 depicts certain preferred embodiments of the method of the present invention. 
0 Figure 5 depicts specific complex template structures capable of further diversification 

using synthetic organic chemistry or biosynthetic pathways. 




15 



Detailed Description of the Invention 

Recognizin^ahe desirability of utilizing both the efficient and powerful methods of 
natural products biosynthesis and the the diverse repetoire of reactions available in synthetic 
organic chemistry, a method for merging combinatorial biosynthesis incorporated above with 
Ichniques of synthetic organic chemistry is provided. In general, this method, combinatorial 
biology, involves 1) providinV'starter units", wherein the starter units are capable of being 
accepted by the modular biosynthetic enzymatic machinery, and have incorporated therein a 
"functional handle" capable of reacting with specific functionality present on a solid support; 2) 
eeding these "starter units" into the modular biosynthetic enzymatic machinery, in vivo or in 
vitro, to obtain complex template moleWes; and 3) further fiinctionalizing the complex template 
molecules using synthetic organic chemistoy to provide a collection of complex "unnatural" 
natural products having structural, topological, stereochemical and functional diversity. As used 




herein, the term ^^rter unit! >comprises any compound that can be incorporated mto the 
biosynthetic pathway. For example, certain bio^nthetic enzymes, such ^^^^^tide synthases^ 
utilize two different classesj ^"starter units", specmpally "initiator" molecules and "extender" 
molecules, typicall ^gcet ates or propionates. For theVurposes of the present invention, either 
category, '^initi^^ as a "starte^molecule. 

Aaxmeof ordinary skill in the art will realize, because the starter units have incorporated 
therein afunctional handl ^, they, or any of the products generated from these starter units, are 
ipable of beiW attached to solid support units at any stage in the combinatorial biosynthetic 
athway. In on^preferred embodiment, the starter units are attached to solid support units prior 
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any combination of synthetic organic che. 
enzymatic machinery. In another 



biosynthetic enzymatic machij 



to feeding the starter units into the modular biosynthetic enzymatic ma^Jikl^ry, and thus the 
support bound starter units are fed into the biosynthetic enzymati€machinery to generate a 
X collection of complex template structures. These tenmktfe structures thus generated can then be 
^fi^ther functionalized using synthetic organic cj>emistry, or can be further functionalized using 

and reintroduction into the biosythetic 
embodiment, the starter units are fed to the modular 
prior to being attached to the solid support, and thus template 
structures are generated.^ particularly preferred embodiments specific functionalities can also 
be incorporated into^ne template structures via the original starter unit capable of being 
recognized by arfantibody and then purified. Alternatively or additionally, these templates can 
then be att^dhed to solid support units and further functionalized using synthetic organic 

or any combination of synthetic organic chemistry and the biosynthetic enzymatic 
fhinery. 

Thus, the present inventip#rcpresents a broadening of the concept of combinatorial 
biosynthesis to incorporajp^tlie advantages of organic synthetic techniques on the solid phase to 
gOTierate increasingly'^omplex "unnatural" natural products. Various characteristics of the 
^ garter units anjHhe reactions utilized in preferred embodiments of the present invention are 
ftsucssecUn more detail below; certain examples of the method of the present invention are also 
presented. 



Biosynthetic Enzymatic Machinery 

In principle, the inventive combinatorial biology methods may be applied to any 
biosynthetic pathway in which the synthetic enzymes will accept inventive starter molecules. In 
certain embodiments of the invention, the starter molecules are provided to living cells in which 
the synthetic enzymes are operating, and synthetic reactions in accordance with the present 
invention ar^performed,in^ivo. Alternatively, the biosynthetic pathway may be reproduced in 
vitro, and the starter molecules may be provided to the synthetic enzymes in that context. 
Preferred biosynthetic pathways to which the inventive technology may be applied include the 
^mmal f attya^ synthasepatK^yy the peptide synthetase 
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pathway, and the terpene^orisoprenoid) synthase 



sep^^ay. In 



certain preferred embodiments of 



the invention, the naturally-occurring synthetic enzymes are employed and are simply provided 
with non-natural starter molecules as described herein. 

Alternatively, as mentioned above, various researchers have^3e modifications to 
certain preferred biosynthetic enzymes that alter their catalvtk^properties (see, for example, 
Cortes et al., Science 268:1487, 1995; Kao et al., J.AtfdChem. Soc. 1 17:9105, 1995; Donadio et 
., Science 252:675, 1991; WO 93/1363; U.S^Patent Number 5,824,513; WO 98/49315; U.S. 
tent Number 5,652,1 16; U.S. Patentftkfmber 5,824,774; WO 98/51695; U.S. Patenfnumber 
795,738; and WO 98/01546 ). Moreover, United States Patent Application Serial Number 

, entitleji^Improved DNA Cloning", filed on even date herewith and 

incorporated herein bv^ference, describes a powerful system for the production of modified 
versions of biosvjatnetic enzymes, and in particular for the production of libraries of modified 
enzymes. P^ferred embodiments of the present invention utilize such modified enzymes, and 
preferably libraries of modified enzymes, to catalyze synthetic reactions with inventive starter 
fecules. 

To mention but one particularly preferred embodiment, the system described in the 
above-mentioned "Improved DNA Cloning" patent application can be utilized to generate a 



library of class I (golyketid e^nthase enzymes in which a particular "AT" domain responsible for 
choosing an inventive altered starter molecule has been shuffled to a variety of different 
positions in the molecule. Exposure of such an enzyme library UKhe appropjriate collection of 
natural and inventive^arterand extender moleculgswill result in incorporation of inventive 



molecules at various locations in the polyketide-type compound being synthesized, so that a 
wide variety of different polyketide-type molecules can be produced by subsequent 
combinatorialization of the compounds generated by the synthetic enzyme. 



Starter Units 

As discuss^d^bove, the method of the present invention utilizes any combination of 
nzymatic maemmry that can be employed for specific classes of biosynthetic reaction 
athwava: \n determining the specific "starter units" that will be utilized in the synthesis, 
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consideration must also be made to the desired biosynthetic pathway to employ^pacrtnus a 
desired family of compounds to be synthesized. For example, specific bjaSyrithetic pathways 

• starter units known to be 
accepted by the modular enzymes to produce a variety of rmtrfr al products. In but one example, 
carboxylic acid building blocks having differing funcjkmal moieties such as methyl, ethyl, 
propyl groups are utilized by modular polyketidp^nthases to yield topologically, functionally 
ptereochemically diverse structures supHas 6-deoxyerythronolide B and tylactone (see , 
^Khbsla, Chem. Rev. 1997, 97, 2577^^imilarly, as shown in Figure 1 , functionalized thioesters 
are utilized to generate a series^faerivatives having structures similar to erythromycin (see, 

hosla, Chem. Rev. 1997/^7, 2577). As one of ordinary skill in the art will realize, the modular 
enzymes that are produced by the shuffling may not result in the ability to predict the specific 
starter units th^fwill be incorporated into the enzymatic machinery. Thus, the ability of a 
desired or j^hdom set of starter units to become attached to the solid support and subsequently 
diversified using combinatorial techniques (split-pool synthesis) becomes particularly important 
ffontifiymg functional starter units. 

Clearly, as mentioned, each of the starter units provided mjwffBe capable of being 
accepted by the enzymes "machinery", and thus the functionality and structural topology must be 
\>£ompatible with the biosynthetic pathway. In but ope^example, as shown in Figure 2, it would be 
afesirable for a terpene-based pathway to utilizHunctionalized derivatives of a farnasyl 
pyrophospate analog. Additionally, for a&'amino-acid based pathway (peptide synthases), it 
would be desirable to utilize functiorfmized derivatives of amino acids. Figure 2 additionally 
provides a collection of suitaW^starter units" classified according to their utility in a specific 
biosynthetic pathway. Fu^diennore, these units can be;tlem^ at a 

stage prior to feeding/die units into the enzymatic machinery to produce structures having more 
diverse functionaries, or fo produce structures having more complex topology. Thus, any 
structure maybe utilized as a "starter unit", regardless of the complexity and functional diversity 
of the compound, provided that the starter unit is capable of being utilized by the enzymatic 
pathway. 
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In addition to the selection of suitable starter units that will ultimately be accepted by the 
enzymatic machinery, it is also necessary to ensure that each of the "starter units" that are fed 
into the modular biosynthetic enzymatic "machinery" preferably contains a suitable functional 
"handle" capable of attachment to functionality present on the solid phase. Figure 2 also depicts 
exemplary starter units having^^mesmcorporated therein as functional handles. This can be 
effected by modifying the starterunits that will be utilized in a particular biosynthetic pathway to 
incorporate the specific desired functionality for coupling to the solid support. As one of 
ordinary skill in the art will realize, attachment to the solid phase can occur prior to feeding the 
starter units into the enzymatic machinery, or after feeding the starter units into the enzymatic 
machinery to generate complex template structures to be attached to the solid support. Because 
attachment can occur prior to or after exposure to the enzymatic machinery, it is particularly 
preferred that the functional handle is chemically robust and thus is capable of withstanding any 
of the reaction conditions encountered in the enzymatic machinery. For this reason, particularly 
robust functionalities such^a^al^ are particularly preferred, 

although the method of the present invention is not limited to these functionalities. 

In an exemplary embodiment, the starter units utilized in the present invention have 
alkyne functionalities incorporated therein as handles for attachment to the solid support. To 
illustrate the incorporation of "handles" into the starter units, Figure 3 depicts the use of the 
Sonagashiro/Castro-Stephens reaction, in which unactivated aryl halides react with copper 
acetylides to give good yields of arylacetylenes, for the conversion of an aryl iodide to an aryl 
acetylene in the iodo substituted aryl thioester derivative. 

An additional important matter to take into consideratjprfwhen utilizing a starter unit 
^-—having "handles", is the number of "handles" that willulfnnately be incorporated into the 
Complex template as a result of the biosynthesis^for example, if a simpler (less structurally 
Complex) starter unit is utilized (for example sne that is used in successive condensation 
reactions, as exemplified in the pplylcetide synthases) a larger number of handles will be 
incorporated. Thus, if one>tf6re to incorporate a handle into a starter unit that is utilized in 
successive "rounds"J>/the enzymatic machinery, a template structure would result having many 
handles incorpOTdted therein. Thus, to overcome this problem, it is preferable to 1) utilize starter 
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units that are only incorporated once, as depicted in Figures 2 and^l5y the use of the aryl 
functionality, and as may be dictated by the selectivity of^pmfic reactions in the enzymatic 
pathway; 2) to incorporate the handle into a limitedrifimber of starter units (for example, one or 
^w^units) involved in the synthesis, or 3)tojrtmze more complex starter units, as depicted in 
^igure 2, for the polyketide pathway, apcfthus the enzymatic machinery does not need to utilize 
a^many of the starter units and tj*£refore fewer handles will bejncorporated. As one of ordinary 
skill in the art will realize > ytfien selecting specific starter it is necessary to take into 
consideration the spepific enzymatic biosynthetic pathway to be utilized (and the specificity of 
incorporation opdertain starters) and the number of "handles" desired in the resulting template 
stracture >/ 0nce the "handles" have been incorporated into the "starter units", the starter units 
can.pftner be fed directly into the enzymatic machinery or coupled directly to the solid support. 

Whether the starter unit or the template structure is utilized/ror coupling to the solid 
support, (which, as one of ordinary skill in the art will realize/fflso has an alkyne, or other 
desirable functionality, bound thereto via a cleavable bora utilizng standard synthetic organic 
techniques) the two alkynes incorporated can then becoupled, in a preferred embodiment, using 



glaser Coupling reaction) Specifially, in thi^rcaction, the addition of copper (II) acetate to the 




action medium effects coupling of the tyro components to yield a solid support having bound 
reto a starter unit, or a template stnicture, via a diyne functionality. In other exemplary 
embodiments, suitable functionaJ/Kandles which can also be easily incorporated into starter units 
and solid support units using^farndard techniques of synthetic organic chemistry, include olefins 
and iodoalke^s>^^ of the components can be effected usingjalefin metathe^s and 




Stille Coupling reactfons, respectively. One of ordinary skill in the art will realize that although 



the abovementipned functionalities are particularly preferred, other functionalities that will not 
interfere wj*h, or be altered by, the chemistry being employed by the enzymatic modular 
machinery may also be utilized. 



Combinatorial Bioorganic Synthesis 

Once the selection of desired starting materials is achieved and functional handles are 
incorporated therein for attachment to the solid phase and optionally for purification purposes, 



January^ 1999 
3079836.1 



9 



# • 

several variations of the present invention may be employed to obtain "unnatural" natural 
products. Figure 4 refers to preferred embodiments of the present invention and a discussion of 
each of these embodiments with reference to Figure 4 is presented below. 

As shown in Figure 4, genetic manipulation of the modular enzymatic machinery, yields 
a mutated biosynthetic pathway to be utilized for the generation of novel "unnatural" natural 
products. Once one or more desired mutated biosynthetic pathways are developed, either 
randomly or with a particular target in mind, as discussed above, the tools of synthetic organic 
chemistry can be employed prior to, concurrently with, or after exposure to the enzymatic 
machinery to further diversify and increase the universe of "unnatural" natural products that can 
be generated. Specifically, referring to Figure 4, selected starter units can be fed into this 
mutated biosynthetic pathway in two different ways according to different embodiments of the 
present invention. . 

In one particularly preferred embodiment, as shown by pathway A, the selected starter 
'ts having the fiinctional\andles incorporated therein can be fed directly into the mutated 
iosynthetic enzymatic machmery. Thus, after exposure to the enzymatic machinery, a unique 
mplate compound can be obtained and at this stage can be purified, preferably using an 
antibody recognition element, or can also be attached to a solid support unit. Subsequently, after 
attachment to a solid support unit, thertemplate could be reintroduced into the enzymatic 
machinery (same or different) to thus ootain a modified template. Alternatively, and/or 
additionally, the template could be utilizeckin split-pool organic synthesis to generate 
combinatorial libraries of complex "unnatrua^" natural products. 

In another particularly preferred embodiment, as shown in pathway B, the selected starter 
units having the functional handles incorporated therein can be attached to the solid support prior 
to being exposed to the biosynthetic enzymatic machinery. Alternatively, or additionally, a 
collection of starter units could be generated using combinatorial synthetic organic chemistry. 
After one or more starter units are fed into the enzymatic machinery, the starter units are 
transformed into a support bound template structure. As Figure 4 depicts, the template structure 
resulting from an initial exposure to a single biosynthetic pathway could be reexposed to the 
same biosynthetic or to another biosynthetic pathway to further modify the structure. 
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Alternatively, or additionally, the solid support template structure could be diversified using split 
and pool techniques and synthetic organic chemistry. As one of ordinary skill in the art will 
realize, any combination of the enzymatic machinery and split-pool synthetic organic techniques 
can be employed to increase the structural, functional, topological and stereochemical diversity 
to generate a collection of unique "unnatural" natural products. 

ferring to the embodiments discussed above, either split-pool or parallel synthesis 
"^■methods can\be employed at each stage of the inventive method to provide a desired collection 
^^^ompoundsNFor example, if the starter unit is not initially attached to the solid support a 
^ parallel synthesi^iechnique is preferably utilized so that the product resulting from the 

enzymatic machinerysean be identifed using spatial encoding methods. Additionally, subsequent 
identification of compoWis using standard methods such as nuclear magnetic resonance 
spectroscopy or mass spectrometry can also be employed to identify specific compounds. 
Depending on the size of the [Horary of compounds desired for the synthesis, the idenification of 
individial compounds may be pro^bitively time consuming and therefore a spatial encoding 
method may be more particularly preferred. 

As discussed above, in one embodiment of the invention, the templatestructures are 
generated in solution, rather than on a s olid support unit. In a preferred embodiment, for the 
generation of a collection of compounds in solution, a parallel synthesis technique is utilized, in 
which all of the products are assembled separately in their own reaction vessels. In a particularly 
preferred parallel synthesis procedure, a microtitre plate containing n rows and m columns of 
tiny wells which are capable of holding a few milliliters of the solvent in which the reaction will 
occur, is utilized. One of ordinary skill in the art will realize that this particular procedure is 
most useful when smaller libraries are desired, and the specific wells can provide a ready means 
to identify the library members in a particular well. 

l another more particularly preferred embodiment of the present invention, a solid phase 
^rthesis technique is utilized for the biosynthesis of the templates and the diversified structures, 
^(pr j^ternativeiv for the synthesis of the diversified structures produced from template structures 
^ generated in solution as described above. As discussed in detail, the starter units or the template 
structures are attached to the solid phase directly or though a linking unit, depending on the stage 



January 5, 1999 
3079836.1 



11 




15 



:3 



ft 




30 



of the procedure a solid phase synthesis is desired. Advantages of solid phase techniques, most 
particularly at thk stage when the template structures are functionalized using synthetic organic 
techniques, inclucre the ability to more easily conduct multi-step reactions and the ability to drive 
reactions to completion because excess reagents can be utilized and the unreacted reagent 
washed away. Perhaps one of the most significant advantages of solid phase synthesis is the 
ability to use a technique called "split and pool", in addition to the parallel synthesis technique, 
d by Furka. (Furica et al., Abstr. 14th Int. Congr. Biochem., Prague, Czechoslovakia, 
J, 47; Furka et al.,W J. Pept Protein Res. 1991, 37, 487; Sebestyen et al., Bioorg. Med. 
'hewLLett, 1993, 5, 413. \ In this technique, a mixture of related compounds can be made in the 
same reaction vessel, thus substantially reducing the number of containers required for the 
synthesis of very large libraries, such as those containing as many as or more than one million 
library members. As an example, the solid support templates or starter units can be divided into 
n vessels, where n represents the number species of reagent A, or the number of different 
biosynthetic pathways created bAthe shuffling procedure, to be reacted with the template 
structures or starter units. After reaction, the contents from n vessels are combined and then split 
into m vessels, where m represents Yhe number of species of reagent B, or the number of 
different biosynthetic pathways creared by the shuffling procedure, to be reacted with the 
scaffold structures. This procedure is repeated until a desired collection of structures is obtained 
to yield a library of "unnatural" natural pk>ducts. 

e use of solid phase techniques in the present invention may also include the use of a 
specific encoding technique. Specific encoding techniques have been reviewed by Czarnik. 
zarnik, A.W^Current Opinion in Chemical Biology, 1997, 7, 60.) As used in the present 
mention, an encoding technique involves the use of a particular "identifying agent" attached to 
solid support, wKich enables the determination of the structure of a specific library member 
without reference to its^spatial coordinates. One of ordinary skill in the art will also realize that 
if smaller solid phase libraries are generated in specific reaction wells, such as 96 well plates, or 
on plastic pins, the reaction \istory of these library members may also be identified by their 
spatial coordinates in the particular plate, and thus are spatially encoded. It is most preferred, 
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^yever torj&ge combinatorial libraries, to use an alternative encoding technique to record the 
reaction history. 

of particulary preferred alternative'enco ding techniqu e^ that can be utilized in 
the present\nvention include, but are not limited to, spatial encoding techniques, graphical 
encoding techniques, including the "tea bag" method, chemical encoding methods, and 
spectrophotomqtric encoding methods. ^Spatial encoding refers to recording a reaction's history 
sed on its locattpn. Graphical encoding techniques involve the coding of each synthesis 
latform to permit &ie generation of a relational database. Examples of preferred 
ectroph otometic encoding methods include the use of mass spectroscopy, fluorescence 
emission, and nuclear magnetic resonance spectroscopy. In a most preferred embodiment, 
chemical encoding methods are utilized, which uses the structure of the reaction product to code 
for its identity. Decoding using this method can be performed on the solid phase or off of the 
solid phase. One of ordinary skill in the art will realize that the particular encoding method to 
be used in the present invention must be selected based upon the number of library members 
desired, and the reaction chemistryN^mployed. 

ibsequent characterization of the library members, which can include either the 
folds o gained after the biosynthetic pathway or the complex molecules obtained after 
^Vdi^ersificatioirusing synthetic organic chemistry, can be performed using standard analytical 
techniques, suchj^s mass spectrometry, Nuclear Magnetic Resonance Spectroscopy, and gas 
cra^matrograpy. One of ordinary skill in the art will realize that the selection of a particular 
analytical technique will depend upon whether the inventive library members are in the solution 
phase or on the solid phaS 



Reactions at latent functionality in the inventive templates to generate u nonna tural " compounds 

Once the inventive templates have been synthesized as discussed above, diversificatio n 

c 

reactions may be employed at each of the different latent functionality sites present in the 
template structures. As mentioned previously, these functiqnalized templates may then also be 
reintroduced into the enzymatic machinery for further functionalization and structural changes, 
as well as be further functionalized using synthetic organic chemistry, until a desired collection 
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of compounds is obtained. One of ordinary skill in the art will appreciate that at any stage the 
reactivity of a particular functionality present in the template structure must be considered when 
selecting particular reagents for diversification. 

e 5 depicts specific natural products which are capable of being generated using the 
method oftbe present invention, or alternatively derivatives of which are also capable of being 
generated usihg the method of the present invention. Specifically, these structures exemplify the 
variety of sites of latent functionality at which d iversification can be achieved u sing synthetic 
bi natorial techniq ues. As discussed previously, this can be achieved using solution or solid 
methods, usink parallel or split-pool techniques. It is particularly preferred, however, to 
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utiliz ^^ohd phase split-pool techmquesT^ xamples of specific reactions to which some or all of 
e systems depicted in Eigure 5 can be subjected to in solution or on the solid support include, 
are not limited to, i) addition of nucleophiles (such lis primary and secondary amines), ii) 
functionalization of free hyafcoxyls with electrophiles (for examples isocyanates, anhydrides, or 
acid chlorides, iii) opening ofspoxides with nucleophiles, such as amines, under ytterbium 
catalysis, iv) functionalization oiiaromatic rings, specifically functionalization at an aryl iodide 
by conversion to such structures aiWnines, amides, aromatic rings, alkenes, alkynes, and 
heterocycles using palladium catalysed chemistry such as Buchwald-Hartwig animations, Heck 
and Stille couplings, Sonogashira/Casrto-Stephens couplings, Suzuki and Stille couplings, and 
carbonylations. Furthermore, resulting aWl alkynes can undergo rhodium-catalyzed 
hydroacylation and azide cycloaddition ana nitrone and nitrile oxide cycloadditions. Other 
examples of diversification reactions at potential sites include reactions at amines and amides. 
For example, amides may be functionalized using a Mitsunobu reaction to generate alcohols 
such as straight chain, branched, and cyclic alcoaols. 

Additionally, for each of the compounds produced by the inventive method, further 
reactions may be employed to attach biomolecules (such as polysaccharides, nucleic acids, or 
peptides) or polymers to appropriate functionalities. 

Onkof ordinary skill in the art will realize that the above examples are representative of 
reactionslhat can be used to diversify not only the templates, but also the starter units, of the 



present invention and are not intended to be exclusive. Rather, all equivalents thereof are 
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intended to be within the scope of the presently claimed inv^rrfion. |A skilled artisan in the field 
of synthetic organic chemistry will be able to readily i^ntify those reagents capable of reacting 
to create further diversity at selected sites in the b^entive template structures and starter units to 
generate compounds and libraries of compoafids reminiscent of natural products. The inventive 
method is particularly useful for the gelation of such compounds because it incorporates the 
hey and creativity of a methaa for manipulating the enzymatic reactions that produce such 
ex structures in nature With the arsenal of reactions available using synthetic organic 
tecl^niques. For exampl^me generation of complex templates can be difficult utilizing only 
synthetic organic te^Kniques, however, nature efficiently and elegantly provides these templates. 
Combining this/vith synthetic organic chemistry enables the use of reactions, such as palladium 
catalyzed relictio ns that are not available in nature and thus the best of both systems can be 
utilizes!™ generate compounds having unprecedented structural, topological, stereochemical 
and^Eunctional diversity. 
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Uses 

The methods, compounds and libraries generated by the method of the present invention 
can be utilized in various disciplines. For example, the complex molecules generated in the 
method of the present invention may modulate the biological activity of a biological target, such 
as a protein, nucleic acid , li pid or combination thereof. In one preferred embodiment, the 
compounds generated by the method of the invention are utilized in chemical genetics assays to 
aher^Le^ of such biological molecules. Altei^fivelyor 

additionally, the compounds may be used in in vitro assays, or any other system that allows 
detection of a chemical or biological function. 

In a particularly preferred embodiment of the imiefition, one or more inventive 
'compounds is contacted with a biological targep^^ng a detecfableTioc Remical acti vity. Such 

iological targets include, for example, epz^mes, receptors, subunits involved in the formation of 
mtrltimeric complexes. Such mukiifleric complex subunits may be characterized by catalytic 
capabilities (such as, for exejfirfple, an ability to catalyze substrate conversion), or may 
alternatively be primarily active in binding to one or more other molecule. The biological target 
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can be provided in the form of a purified or semi-purjfkftfcomposition, a cell lysate, a whole cell 
or tissue, or even a whole organism. The lev^J^Jf biochemical activity is detected in the presence 
V of the compound, and a statistically significant change in the biochemical activity, relative to the 
^^ievej of biochemical activity in tfedabsence of the compound, identifies the compound as a 
modulator, e.g. inhibitor op^otentiator of the biological activity of the target protein. In some 
aases, particularly wja^re assays are done on whole cells or organisms, the effect of the chemical 
compound maytSe to alter the amount, in addition to or instead of the activity, of the particular 
biologicaWarget. "Modulators", therefore, are chemical compounds that alter the level or 
acti^tfy of a particular target molecule. 

In one particularly preferred embodiment of the present invention, multiple compounds 
are assayed simultaneously in a high-throughput format, preferably allowing simultaneous 
analysis of at least 500,000 compounds, preferably at least 1 ,000,000 compounds, and most 
preferably at least or more than 2,000,000 compounds. 

^s discussed above, once a specific desired effect on a biological target has been 
associated Vith a particular compound of the inventive library, the compounds of the present 
invention may\be utilized as a therapeutic agent for a particular medical condition. A 
^4herapeutic agenrsfor use in the present invention may include any pharmacologically active 
Xv \ sumstances that produce a local or systemic effect in animals, preferably mammals, or humans. 
Tl e term thus means any substance intended for use in the diagnosis, cure, mitigation, 
treatment or prevention of disease or in the enhancement of desirable physical or mental 
development and conditionsSin an animal or human. The therapeutic agent may be administered 

orally, topically or via injectioirby itself, or additionally may be provided as a pharmaceutical 
composition comprising the therapeutic agent and a biologically acceptable carrier. The 
inventive compositions can be, but afce not limited to an aqueous solutions, emulsions, creams, 
ointments, suspensions, gels, and liposomal suspensions. Particularly preferred biologically 
acceptable carriers include but are not limited to water, saline, Ringer's solution, dextrose 
solution and solutions of ethanol, glucose, sucrose, dextran, mannose, mannitol, sorbitol, 
polyethylene glycol (PEG), phosphate, acetate, gelatin, collagen, Carbopol, and vegetable oils. 
It is also possible to include suitable preservatives, stabilizers, antioxidants, antimicrobials, and 
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buffering agents, W example including but not limited to BHA, BHT, citric acid, ascorbic acid, 
id tetracycline. Tfte therapeutic agents of the presently claimed invention may also be 
(y^win^^orated or encapsulated in a suitable polymer matrix or membrane, thus providing a 
^ dustained-release deliveAdevice suitable for implantation near the site to be treated locally. 

As one of ordinary skill in the art will realize, the amount of the therapeutic agent 
required to treat any particular disorder will of course vary depending upon the nature and 
severity of the disorder, the age and condition of the subject, and other factors readily 
determined by one or ordinary skill in the art. 

In alternative embodiments, the compounds and libraries of the present invention may 
also be used for the development of cosmetics, food additives, pesticides, and lubricants to name 
a few. Furthermore, the compounds and libraries of the present invention may also be used for 
the development of novel catalysts and materials. For example, the inventive compounds may 
be useful as ligands for transition metal catalysts and the inventive libraries may be useful for the 
rapid identification of novel ligands. These compounds and libraries of compounds may also 
function by acting in concert with a particular transition metal catalyst to effect a particular 
desired chemical reaction. Additionally, the inventive compounds and libraries of compounds 
are also useful in the area of materials science. Because of the reactive moieties present in these 
compounds, molecules such as lipids and other polymeric materials may be attached and thus 
generate potentially important biomaterials. 

One of ordinary skill in the art will realize that the present invention is not intended to be 
limited to the abovementioned uses, but rather may be employed in many contexts and 
disciplines. 
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